NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Fast Chip Transient Temperature Simulation via Machine Learning

https://doi.org/10.1109/MLCAD65511.2025.11189189

Hajikhodaverdian, Mohammadamin; Reda, Sherief; Coskun, Ayse K (September 2025, IEEE)

With growing transistor densities, analyzing temperature in 2D and 3D integrated circuits (ICs) is becoming more complicated and critical. Finite-element solvers give accurate results, but a single transient run can take hours or even days. Compact thermal models (CTMs) shorten the temperature simulation running time using a numerical solver based on the duality between thermal and electric properties. However, CTM solvers often still take hours for small-scale chips because of iterative numerical solvers. Recent work using machine learning (ML) models creates a fast and reliable framework for predicting temperature. However, current ML models demand large input samples and hours of GPU training to reach acceptable accuracy. To overcome the challenges stated, we design an ML framework that couples with CTMs to accelerate steady-state and transient thermal analysis without large data inputs. Our framework combines principal-component analysis (PCA) with closed-form linear regression to predict the on-chip temperature directly. The linear regression weights are solved analytically, so training for a grid size of 512 × 512 finishes in under a minute with only 15–20 CTM samples. Experimental results show that our framework can achieve more than 33x and 49.6x speedup for steady-state and transient simulation of a chip with a 245.95mm^2 footprint, keeping the mean squared error below 0.1 deg C^2 .
more » « less
Free, publicly-accessible full text available September 8, 2026
Fast Machine Learning Based Prediction for Temperature Simulation Using Compact Models

https://doi.org/10.23919/DATE64628.2025.10993231

Hajikhodaverdian, Mohammadamin; Reda, Sherief; Coskun, Ayse K (March 2025, IEEE)

As transistor densities increase, managing thermal challenges in 3D IC designs becomes more complex. Traditional methods like finite element methods and compact thermal models (CTMs) are computationally expensive, while existing machine learning (ML) models require large datasets and a long training time. To address these challenges with the ML models, we introduce a novel ML framework that integrates with CTMs to accelerate steady-state thermal simulations without needing large datasets. Our approach achieves up to 70× speedup over state-of-the-art simulators, enabling real-time, high-resolution thermal simulations for 2D and 3D IC designs.
more » « less
Free, publicly-accessible full text available March 31, 2026
Distributed Economic Dispatch in Power Networks Incorporating Data Center Flexibility

https://doi.org/10.1109/TSUSC.2025.3538817

Tsiligkaridis, Athanasios; Andrianesis, Panagiotis; Coskun, Ayse K; Caramanis, Michael C; Paschalidis, Ioannis Ch (January 2025, IEEE Transactions on Sustainable Computing)

Full Text Available
Conductor: A Collaboration Framework for Multi-Data-Center Demand Response

https://doi.org/10.1109/IGSC64514.2024.00013

Acun, Fatih; Paschalidis, Ioannis Ch; Coskun, Ayse K (November 2024, IEEE)

Full Text Available
LLMs Cannot Reliably Identify and Reason About Security Vulnerabilities (Yet?): A Comprehensive Evaluation, Framework, and Benchmarks

https://doi.org/10.1109/SP54263.2024.00210

Ullah, Saad; Han, Mingji; Pujar, Saurabh; Pearce, Hammond; Coskun, Ayse; Stringhini, Gianluca (May 2024, IEEE)

Full Text Available
Data Center and Load Aggregator Coordination Towards Electricity Demand Response

https://doi.org/10.1016/j.suscom.2024.100957

Zhang, Yijia; Tsiligkaridis, Athanasios; Paschalidis, Ioannis Ch.; Coskun, Ayse K. (January 2024, Sustainable Computing: Informatics and Systems)

Full Text Available
An End-to-End HPC Framework for Dynamic Power Objectives

https://doi.org/10.1145/3624062.3624262

Wilson, Daniel C.; Acun, Fatih; Jana, Siddhartha; Ardanaz, Federico; Eastep, Jonathan M.; Paschalidis, Ioannis Ch.; Coskun, Ayse K. (November 2023, ACM)

Full Text Available
Processing-in-Memory using Optically-Addressed Phase Change Memory

https://doi.org/10.1109/ISLPED58423.2023.10244409

Yang, Guowei; Demirkiran, Cansu; Kizilates, Zeynep; Ocampo, Carlos; Coskun, Ayse; Joshi, Ajay (August 2023, ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED) 2023)

Today’s Deep Neural Network (DNN) inference systems contain hundreds of billions of parameters, resulting in significant latency and energy overheads during inference due to frequent data transfers between compute and memory units. Processing-in-Memory (PiM) has emerged as a viable solution to tackle this problem by avoiding the expensive data movement. PiM approaches based on electrical devices suffer from throughput and energy efficiency issues. In contrast, Optically-addressed Phase Change Memory (OPCM) operates with light and achieves much higher throughput and energy efficiency compared to its electrical counterparts. This paper introduces a system-level design that takes the OPCM programming overhead into consideration, and identifies that the programming cost dominates the DNN inference on OPCM-based PiM architectures. We explore the design space of this system and identify the most energy-efficient OPCM array size and batch size. We propose a novel thresholding and reordering technique on the weight blocks to further reduce the programming overhead. Combining these optimizations, our approach achieves up to 65.2x higher throughput than existing photonic accelerators for practical DNN workloads.
more » « less
Full Text Available
TREAD-M3D: Temperature-Aware DNN Accelerators for Monolithic 3D Mobile Systems

https://doi.org/10.1109/TCAD.2023.3285039

Shukla, Prachi; Pavlidis, Vasilis F.; Salman, Emre; Coskun, Ayse K. (January 2023, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems)

Full Text Available
Architecting Optically Controlled Phase Change Memory

https://doi.org/10.1145/3533252

Narayan, Aditya; Thonnart, Yvain; Vivet, Pascal; Coskun, Ayse; Joshi, Ajay (December 2022, ACM Transactions on Architecture and Code Optimization)

Phase Change Memory (PCM) is an attractive candidate for main memory, as it offers non-volatility and zero leakage power while providing higher cell densities, longer data retention time, and higher capacity scaling compared to DRAM. In PCM, data is stored in the crystalline or amorphous state of the phase change material. The typical electrically controlled PCM (EPCM), however, suffers from longer write latency and higher write energy compared to DRAM and limited multi-level cell (MLC) capacities. These challenges limit the performance of data-intensive applications running on computing systems with EPCMs. Recently, researchers demonstrated optically controlled PCM (OPCM) cells with support for 5bits/cellin contrast to 2bits/cellin EPCM. These OPCM cells can be accessed directly with optical signals that are multiplexed in high-bandwidth-density silicon-photonic links. The higher MLC capacity in OPCM and the direct cell access using optical signals enable an increased read/write throughput and lower energy per access than EPCM. However, due to the direct cell access using optical signals, OPCM systems cannot be designed using conventional memory architecture. We need a complete redesign of the memory architecture that is tailored to the properties of OPCM technology. This article presents the design of a unified network and main memory system called COSMOS that combines OPCM and silicon-photonic links to achieve high memory throughput. COSMOS is composed of a hierarchical multi-banked OPCM array with novel read and write access protocols. COSMOS uses an Electrical-Optical-Electrical (E-O-E) control unit to map standard DRAM read/write commands (sent in electrical domain) from the memory controller on to optical signals that access the OPCM cells. Our evaluation of a 2.5D-integrated system containing a processor and COSMOS demonstrates2.14 ×average speedup across graph and HPC workloads compared to an EPCM system. COSMOS consumes3.8×lower read energy-per-bit and5.97×lower write energy-per-bit compared to EPCM. COSMOS is the first non-volatile memory that provides comparable performance and energy consumption as DDR5 in addition to increased bit density, higher area efficiency, and improved scalability.
more » « less
Full Text Available

« Prev Next »

Search for: All records